A Fast Method for Property Prediction in Graph-Structured Data from Positive and Unlabelled Examples
نویسندگان
چکیده
The analysis of large and complex networks, or graphs, is becoming increasingly important in many scientific areas including machine learning, social network analysis and bioinformatics. One natural type of question that can be asked in network analysis is “Given two sets R and T of individuals in a graph with complete and missing knowledge, respectively, about a property of interest, which individuals in T are closest to R with respect to this property?”. To answer this question, we can rank the individuals in T such that the individuals ranked highest are most likely to exhibit the property of interest. Several methods based on weighted paths in the graph and Markov chain models have been proposed to solve this task. In this paper, we show that we can improve previously published approaches by rephrasing this problem as the task of property prediction in graph-structured data from positive examples, the individuals in R, and unlabelled data, the individuals in T , and applying an inexpensive iterative neighbourhood’s majority vote based prediction algorithm (“iNMV”) to this task. We evaluate our iNMV prediction algorithm and two previously proposed methods using Markov chains on three real world graphs in terms of ROC AUC statistic. iNMV obtains rankings that are either significantly better or not significantly worse than the rankings obtained from the more complex Markov chain based algorithms, while achieving a reduction in run time of one order of magnitude on large graphs.
منابع مشابه
Link Prediction using Network Embedding based on Global Similarity
Background: The link prediction issue is one of the most widely used problems in complex network analysis. Link prediction requires knowing the background of previous link connections and combining them with available information. The link prediction local approaches with node structure objectives are fast in case of speed but are not accurate enough. On the other hand, the global link predicti...
متن کاملPrediction of Electrofacies Based on Flow Units Using NMR Data and SVM Method: a Case Study in Cheshmeh Khush Field, Southern Iran
The classification of well-log responses into separate flow units for generating local permeability models is often used to predict the spatial distribution of permeability in heterogeneous reservoirs. The present research can be divided into two parts; first, the nuclear magnetic resonance (NMR) log parameters are employed for developing a relationship between relaxation time and reservoir poro...
متن کاملSolubility Prediction of Anthracene in Non-Aqueous Solvent Mixtures Using Jouyban-Acree Model
A quanitative structure property relationship was proposed to calculate the binary interaction terms of the Jouyban-Acree model using solubility parameter, boiling point, vapour pressure and density of solvents. The applicability of the proposed method for reproducing solubility data of anthracene in binary solvents has been evaluated using 116 solubility data sets collected from the lite...
متن کاملSome Statistical Methods for Prediction of Athletic Records
Prediction of the sports records has received a great deal of attention from researchers in different disciplines. This article reviews some of the methods developed by statisticians and offers few improvements. Specific methods discussed include trend analysis, tail modeling, and methods based on certain results of the theory of records for independent and identically distributed attempts. To ...
متن کاملA Comparison of the Efficacy, Adverse Effects, and Patient Compliance of the Sena-Graph Syrup and Castor Oil Regimens for Bowel Preparation
Sena-Graph syrup has recently been formulated by an Iranian pharmaceutical company for being used in bowel evacuation before radiography, colonoscopy and surgery. This study compares the efficacy, adverse effects and patient compliance of two bowel preparation regimens with castor oil and Sena-Graph syrup in of outpatients for Intravenous Urography (IVU). One hundred and fourteen consecutive ou...
متن کامل